RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments

نویسندگان

  • Peter Henry
  • Michael Krainin
  • Evan Herbst
  • Xiaofeng Ren
  • Dieter Fox
چکیده

RGB-D cameras are novel sensing systems that capture RGB images along with per-pixel depth information. RGB-D cameras rely on either structured light patterns combined with stereo sensing [6,10] or time-of-flight laser sensing [1] to generate depth estimates that can be associated with RGB pixels. Very soon, small, high-quality RGB-D cameras developed for computer gaming and home entertainment applications will become available at cost below $100. In this paper we investigate how such cameras can be used in the context of robotics, specifically for building dense 3D maps of indoor environments. Such maps have applications in robot navigation, manipulation, semantic mapping, and telepresence. The robotics and computer vision communities have developed a variety of techniques for 3D mapping based on laser range scans [8, 11], stereo cameras [7], monocular cameras [3], and unsorted collections of photos [4]. While RGB-D cameras provide the opportunity to build 3D maps of unprecedented richness, they have drawbacks that make their application to 3D mapping difficult: They provide depth only up to a limited distance (typically less than 5m), depth values are much noisier than those provided by laser scanners, and their field of view (∼ 60◦) is far more constrained than that of specialized cameras or laser scanners typically used for 3D mapping (∼ 180◦). In our work, we use a camera developed by PrimeSense [10]. The key insights of this investigation are: first, that existing frame matching techniques are not sufficient to provide robust visual odometry with these cameras; second, that a tight integration of depth and color information can yield robust frame matching and loop closure detection; third, that building on best practice techniques in SLAM and computer graphics makes it possible to build and visualize accurate and extremely rich 3D maps with such cameras; and, fourth, that it will be feasible to build complete robot navigation and interaction systems solely based on cheap depth cameras.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RGB-D mapping: Using Kinect-style depth cameras for dense 3D modeling of indoor environments

RGB-D cameras (such as the Microsoft Kinect) are novel sensing systems that capture RGB images along with per-pixel depth information. In this paper we investigate how such cameras can be used for building dense 3D maps of indoor environments. Such maps have applications in robot navigation, manipulation, semantic mapping, and telepresence. We present RGB-D Mapping, a full 3D mapping system tha...

متن کامل

Real-time dense appearance-based SLAM for RGB-D sensors

In this work a direct dense approach is proposed for real-time RGB-D localisation and tracking. The direct RDB-D localisation approach is demonstrated on a low cost sensor which exploits projective IR light within indoor environments. This type of device has recently been the object of much interest and one advantage is that it provides dense 3D environment maps in real-time via embedded comput...

متن کامل

Interactive Dense 3D Modeling of Indoor Environments

The arrival of cheap consumer depth cameras, led by Microsoft’s Kinect system, presents a huge opportunity for 3D modeling of personal spaces. While 3D indoor mapping techniques are becoming increasingly robust, they are still too brittle to enable non-technical users to build consistent and complete maps of indoor environments. This is due to technical challenges such as limited lighting, occl...

متن کامل

Patch Volumes: Multiple Fusion Volumes for Consistent RGB-D Modeling

Recent advances have allowed for the creation of dense, accurate 3D maps of indoor environments using RGB-D cameras. Some techniques are able to create large-scale maps, while others focus on accurate details using GPU-accelerated volumetric representations. In this work we describe patch volumes, a novel multiple-volume representation which enables the creation of globally consistent maps of i...

متن کامل

Realtime Visual and Point Cloud SLAM

The availability of affordable RGB-D cameras like Microsoft Kinect can improve VSLAM applications, object 3D modeling and reconstruction of indoor environments, through the use of dense, synchronized depth and color images. The high frame rate of such devices isn’t exploited so far, since they require both fast and accurate algorithms for real-time registration. In this paper we present a techn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010